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Despite their conserved functional role in sexually reproducing 
organisms, spermatozoa are a diverse and rapidly evolving cell 
type. This phenomenon is largely attributed to sexual selection 
in polygamous species where sperm from multiple males 
compete to fertilize a limited number of oocytes. Drosophila 
have proven to be a particularly informative model system for 
the study of spermatogenesis and in this review we discuss 
how the characterization of the Drosophila melanogaster 
sperm proteome has advanced our understanding of the 
evolutionary genomics of sperm form and function. We 
summarize the molecular evolutionary characteristics of 
sperm genes and highlight recent evidence demonstrating 
the importance of novel gene creation in the evolution 
of sperm function and competitive ability. Comparative 
proteomic evidence is also provided, supporting an overall 
functional conservation between the Drosophila and mouse 
sperm proteomes. This analysis reveals a diverse repertoire of 
proteins functioning in proteolytic pathways, as well as the 
presence of proteins of the complement and innate immunity 
systems. We propose that these pathways may have functional 
relevance to post-mating female immunological responses as 
well as coevolved interactions with pathways expressed in the 
female reproductive tract, including those involved in sperm- 
oocyte recognition and fertilization. 



Introduction 

Spermatozoa are among the most morphologically diverse cell 
types despite their conserved role in delivering the paternal 
genetic contribution to the egg at fertilization. 1 The remark- 
able diversification of sperm is believed to be driven by sexual 
selection, where sperm are the focal point of the intense selective 
forces associated with sperm competition. The theory of sperm 
competition was first proposed by Geoff Parker 2 and has sub- 
sequently become a major discipline of evolutionary research. 
Drosophila have proven to be an extremely amenable model 
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system for the study of sperm competition and we now possess a 
remarkably advanced understanding of the selective forces asso- 
ciated with sperm competition 3 and morphological diversifica- 
tion of sperm within this group. 4 Drosophila also represent one 
of the most sophisticated model systems due to the development 
of advanced genetic tools, a carefully curated genome annota- 
tion and the availability of an enormous range of mutant strains. 
The use of these resources, in conjunction with complementary 
experimental studies of sperm, has resulted in an unparalleled 
understanding of the molecular and physiological impact of 
mating, 5 as well as sperm behavior in the female reproductive 
tract. 6 In Drosophila, spermatogenesis proceeds in an essentially 
linear manner from the apical to distal end of the testis, with 
distinct testis regions enriched for cells in mitotic, meiotic and 
post-meiotic stages of spermatogenesis. It has therefore been pos- 
sible to isolate sperm from these regions by dissection to analyze 
gene expression at these primary timepoints during this care- 
fully regulated differentiation process. 7 

Despite the utility of Drosophila as a model for the study of 
spermatogenesis, an incomplete understanding of the evolution- 
ary genomics of male reproduction persisted for some time due 
to a paucity of information regarding the molecular composition 
of sperm. This lack of knowledge can be largely attributed to the 
progressive silencing of transcription as the genome is repackaged 
and compacted during spermatogenesis, 8 which limits the util- 
ity of traditional methods of gene expression quantification as a 
means of inferring whether a resultant protein is in fact an integral 
component of mature sperm. Indirect inferences of sperm pro- 
tein composition is further complicated by the fact that the testis 
transcriptome is extraordinarily diverse' making gene expression 
profiles of the testis relatively uninformative. This gap in our 
knowledge base has largely been rectified by rapid advances in 
the field of mass spectrometry (MS) and the subsequent applica- 
tion of proteomic approaches to the study of sperm composition 10 
and reproduction more generally (reviewed in ref. 11). We now 
know that testis gene expression is a poor predictor of whether or 
not the protein product of a given gene is an integral sperm com- 
ponent. 12 Prior to the initial characterization of the Drosophila 
melanogaster sperm proteome (DmSP), 13 only 16 genes had char- 
acterized phenotypes specifically influencing sperm morphology 
or function, of which only 5 had been empirically demonstrated 
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D. melanogaster sperm proteome 



Figure 1. Molecular evolutionary analysis of the Drosophila melano- 
gaster sperm proteome (DmSP). Average nonsynonymous substitution 
rates (± s.e.m.) between D. melanogaster and D. simulans are displayed 
for the DmSP-l (bold), individual functional categories of DmSP genes, 
nonreproductive genes, accessory gland transcripts and ACP genes. 
Substitution rates for accessory gland and ACP genes is based on Swan- 
son et al. PNAS. The figure has been adapted from Dorus et al., 2006. 



to encode protein components in mature sperm. A recent reanal- 
ysis of the D. melanogaster sperm proteome resulted in the robust 
identification of more than 1100 proteins which encode compo- 
nents of the mature spermatozoa. 12 

In this review we focus specifically on how the characteriza- 
tion of the sperm proteome has advanced our understanding of 
the evolutionary and functional genomics of reproduction in 
Drosophila. Wherever possible, we highlight the way in which 
a proteomics based perspective of sperm is being complemented 
by the diverse array of molecular and genetic approaches cur- 
rently being used to understand sperm form and function. We 
begin by providing a summary of the evolutionary genomics of 
the sperm proteome in the context of the rapid evolution of many 
reproductive genes and the more recent conclusions concerning 
the heterogeneity of selective forces operating across reproduc- 
tive systems. We follow with evidence supporting the prominent 
role of gene creation in sperm evolution, citing recent examples 
of novel sperm genes that influence sperm competitive ability. 
We then discuss the results of a comparative analysis of the D. 
melanogaster and Mus musculus (mouse) sperm proteomes, which 
illustrates the overall functional and compositional conserva- 
tion of this cell type. This analysis also reveals the diversity of 
D. melanogaster sperm immunity and proteolytic proteins and 
we discuss their potential role in mediating post-mating female 
immunity responses and coevolutionary interactions with the 
abundant proteolytic proteins present in the female reproduc- 
tive tract. Ultimately, the discussion of sperm proteome research 
highlights the multifaceted insights that can be inferred from 
this unique cell type and how the proteome continues to serve as 



a complementary resource for genetic and molecular investiga- 
tions into sperm function. 

Molecular Evolution of the Sperm Proteome 

Despite widespread historical interest in sperm competition in 
insects and the rapid advancement of molecular studies of acces- 
sory gland proteins in Drosophila, 14 little was known about the 
molecular composition of Drosophila sperm prior to the applica- 
tion of MS to the study of the sperm proteome. Molecular evo- 
lutionary studies of sperm proteins were restricted to a relatively 
small number of examples given the obstacles to empirical charac- 
terization discussed previously. The initial characterization of the 
Drosophila melanogaster sperm proteome 13 (termed the DmSP-I) 
represented the first whole-cell catalog of integral sperm proteins 
and resulted in the first "whole-cell" evolutionary genomic analy- 
sis in eukaryotes. Reanalysis of the proteome, using improved 
methodologies, has resulted in substantial expansion of the pro- 
teome and a significant increase in the depth of coverage. 12 The 
expanded sperm proteome (termed the DmSP-II) represents the 
most comprehensive database of insect sperm proteins for evo- 
lutionary genomic analyses and in this section we highlight the 
main observations stemming from its characterization. 

Evolution of D. melanogaster sperm proteome (DmSP-I). 
The original molecular evolutionary analysis of the DmSP aimed 
to asses the selective forces acting upon the proteome as a whole. 13 
This was achieved using estimates of nonsynonymous (dN) and 
synonymous (dS) rates of divergence for 342 orthologous genes 
between D. melanogaster and its closely related sister taxa, D. 
simulans. Contrary to many studies demonstrating the rapid 
evolution of reproductive genes (reviewed in ref. 15), this analy- 
sis revealed a high level of functional constraint (as measured 
by dN/dS ratio) upon the proteome relative to genes expressed 
in the accessory gland (Fig. 1) and a complete absence of genes 
showing evidence of positive selection (as measured by a dN/dS 
ratio that significantly exceeded 1.0). Thus, it was concluded that 
the sperm proteome, as a whole, is evolving quite conservatively, 
presumably under the influence of functional and structural con- 
straints. Consistent with this conclusion was the pronounced 
levels of functional constraint acting upon genes with cytoskel- 
etal, central metabolic and energetic functions (Fig. 1). It is also 
noteworthy that the sperm proteome is significantly enriched 
for genes within these functional categories, many of which 
are widely expressed within a range of cell types and are thus 
likely to experience pleiotropic functional constraint. 12 Despite 
the absence of any evidence of positive selection, it was revealed 
that DNA/RNA binding factors and genes with uncharacterized 
functions were evolving more rapidly, at a rate statistically indis- 
tinguishable from accessory gland genes. Consistent with this 
observation is the subsequent finding that positive selection is 
likely to have influenced the evolution of Drosophila protamines 
(Mst35Ba and Mst35Bb) which have undergone a D. melanogaster 
specific gene duplication event (see discussion below). 16 

Evolution of the expanded Drosophila sperm proteome 
(DmSP-II). The reanalysis of the DmSP using improved prefrac- 
tionation and MS techniques resulted in a three-fold increase in 



214 



Spermatogenesis 



Volume 2 Issue 3 



the number of empirically characterized proteins (1108 in total). 
In addition, the genes encoding the DmSP-II have now been ana- 
lyzed using maximum likelihood evolutionary models, facilitated 
by the recent availability of 12 genomes across the Sophophora 
subgenus. 17 The findings of this analysis are generally consistent 
with the DmSP-I study and confirm that (1) purifying selection 
is the predominant force acting on the sperm proteome and (2) 
the evolution of the sperm proteome is quite distinct from the 
evolution of genes expressed in the accessory gland (details of 
this analysis are reviewed in ref. 18). One notable exception is 
that the use of more sensitive site-based models revealed the sig- 
nature of positive selection on 8% (77 in total) of the sperm genes 
analyzed. A rather limited number of these genes are testis-spe- 
cific in expression (24%) and few have established functions in 
sperm, thus restricting conclusions about the relative importance 
of sexual selection on these genes. One interesting example of a 
positively selected gene with a well-established sperm function 
is the sperm cation channel, Pkd2, which is implicated in sperm 
competition through the control of the directional motility. 19 
The abundance of genes with uncharacterized functions among 
these positively selected genes, and within the sperm proteome as 
a whole, highlights the potential for targeted molecular studies to 
establish roles in sperm that may influence competitive fertiliza- 
tion ability. 

The expanded coverage of the DmSP-II was also extremely 
informative, as a greatly increased number of genes influencing 
sperm function and development were identified. 12 In fact, a sub- 
stantial proportion of these genes have sperm development phe- 
notypes, arising either during spermatid development or sperm 
individualization, and are likely to have pronounced affects on 
sperm function. In total, 40 genes were known from either classi- 
cal or molecular genetic studies to directly affect sperm develop- 
ment or function with the majority functionally related to sperm 
motility. This includes well-characterized sperm structural pro- 
teins such as (3Tub85D, (3Tub56D and oTub84B, 20 - 23 as well as 
a range of proteins that bind microtubules or have functions in 
microtubule motor activity. Of particular interest among these 
are the Y-linked fertility factors, kl-3 and kl-5, 24 ' 26 which encode 
dynein heavy chains essential to the axoneme outer arm struc- 
ture. 24 " 26 Despite the heterochromatic nature of the Y chromo- 
some, Y-linked genetic variation has been demonstrated to impact 
testis-specific gene expression 27 and it is important to determine 
if fertility factor variation may impact sperm characteristics. 

Another informative observation stemming from the analy- 
sis of the DmSP-II was the confirmation that many sperm genes 
are widely expressed across non-reproductive cell types, and 
that approximately 25% (271/1108) were expressed at least two- 
fold higher in non-reproductive tissues than in the testis. An 
inverse correlation between expression breadth and evolution- 
ary rates has been well-documented and is thought to be caused 
by increasing pleiotropic constraints. 28 Given the enrichment of 
widely expressed structural, central metabolic and energetic pro- 
teins within the DmSP, pleiotropic constraints acting on some 
sperm genes appears likely to explain the evolutionary rate het- 
erogeneity observed between Drosophila sperm genes and other 
male reproductive genes. It is also noteworthy that evolutionary 



analyses of the sperm proteome have focused on the melanogaster 
and obscura groups and have not yet been expanded to include 
more recently available genomic resources in species exhibiting 
sperm gigantism. Additional analyses are required to determine if 
accelerated evolution of the sperm proteome may be concentrated 
on lineages where the phenotypic evolution of sperm has been 
most pronounced. 

Gene Creation and the Functional Evolution 
of Drosophila Sperm 

Duplication of existing genetic material is an essential process 
in the evolution of genome complexity and the creation of genes 
with novel functions. 2 '' 30 Studies of novel genes have been con- 
ducted in a wide range of taxa revealing that many gene dupli- 
cates have acquired novel functions and that this process may 
occur in association with positive selection (reviewed by ref. 31). 
Of relevance is the observation that many novel genes acquire 
testis gene expression, an observation which is particularly com- 
mon among new genes created via retrotransposition, a mecha- 
nism whereby a processed mRNA molecule is reverse transcribed 
and reinserted into the genome. 32 Despite the demonstration that 
novel genes frequently acquire testis expression, the relevance 
of this process to sperm evolution and function was largely 
unknown prior to the characterization of the sperm proteome. 
Comparative genomic analyses of the DmSP-I revealed that 
gene creation, by both DNA- and RNA-based mechanisms, has 
played a prominent role in the evolution of sperm composition. 16 
In the following section, we review the influence of gene creation 
on the evolution of the sperm proteome with an emphasis on 
recent observations regarding the relevance of gene duplication to 
the process of sperm competition and possibly the evolutionary 
diversification of sperm morphology. 

Protamine evolution in D. melanogaster. Tandem duplica- 
tion of genes is a common phenomenon and there are several 
characterized examples of such duplicates which encode integral 
sperm components. 16 One notable example is the lineage-specific 
duplication of protamines {Mst35Ba and Mst35Bb) in D. mela- 
nogaster} 6 Protamines are highly basic molecules responsible for 
replacing histones during genome condensation and repackag- 
ing. 33 Molecular evolutionary analysis in Drosophila revealed 
the signature of positive selection on these gene duplicates dur- 
ing their recent divergence with a concentration of evolution- 
ary changes within the DNA binding high-mobility group box 
domain (HMG). While the functional implications of this 
observation remain unclear, it is interesting to speculate that this 
divergence may alter the temporal or spatial localization of these 
molecules during genome repackaging. Protamines are believed 
to be influenced by sexual selection in both primates 34 and mice, 
and it has been suggested that protamine evolution in mice 
might influence head hydrodynamics and therefore motility. 35 
However, unlike mammals, Drosophila protamine mutants are 
not haploinsufficient and fertility is maintained in double homo- 
zygote knockout strains, although nuclear morphology is often 
abnormal. 36 The non-essential nature of Drosophila protamines 
is intriguing in light of their recent duplication and targeted 
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studies will be required to determine whether this has influenced 
sperm chromatin characteristics relevant to sperm competition 
and fertilization. 

The origination of X-linked sperm gene clusters. In this 
section we summarize evidence supporting the evolution of 
novel X-linked sperm gene clusters and their role in male fer- 
tility. X-linked sperm gene clusters are somewhat unexpected 
in light of the extensive trafficking of male-based genes off the 
X chromosome, as well as a paucity of X-linked testis-expressed 
genes in both Drosophila 32,37 and mammals. 38,39 These observa- 
tions have been attributed to two factors. The first is meiotic sex 
chromosome inactivation (MSCI) whereby the X chromosome 
is progressively silenced during spermatogenesis, thus restricting 
X-linked expression during later stages of spermatogenesis. While 
MSCI has been clearly documented in mammals, 40 its presence in 
Drosophila remains controversial (see ref. 7 but also refs. 41 and 
42). The second is the influence of sexually antagonistic selection 
upon X-linked male-biased genes, ultimately favoring the trans- 
fer of male-biased functions to the autosomes. 43,44 Regardless of 
the ultimate cause, the presence of new and expanding X-linked 
sperm gene clusters is unusual and may provide insights into the 
role of the X chromosome in male fertility. 

The most comprehensive characterization of novel sperm 
gene creation is the Sperm dynein intermediate chain (Sdic) gene 
family, which arose in the D. melanogaster lineage within the 
last 5.4 million years. Sdic is the product of a chimeric gene 
fusion of Annexin X (AnnX) and short wing (sw) that has subse- 
quently been duplicated into a tandem array of 8-10 copies. 45,46 
Sdicl is testis-specific and its protein product has been visual- 
ized in maturing spermatocytes, and the flagellum of mature 
sperm using a SdicwGYP fusion protein under control of the 
Sdic promoter. It is noteworthy that Sdic, for reasons that are 
yet unclear, has not been identified in the proteomic analysis of 
D. melanogaster sperm 12,13 despite the fact it has been visualized 
in mature sperm. Based on its subcellular localization, the well- 
characterized structural role of dyneins in the sperm axoneme 
and a population genetic signature of recent selective sweeps, it 
has been proposed that Sdic is involved in sperm motility and, 
therefore, may be impacted by sexual selection associated with 
sperm competition. 47 This proposition is supported by the recent 
analysis of strains harboring Sdic deletions which found a sig- 
nificant decrease in sperm competitive ability in comparison to 
wild-type sperm. 46 Thus, strong support exists for an evolution- 
arily acquired role for the X-linked Sdic gene cluster in sperm 
function that has direct relevance to sperm competition. 

A second example, with clear similarities to the evolution of 
the Sdic gene cluster, is the recent evolutionary expansion of the 
X-linked tektin gene cluster. 13 Tektins are cytoskeletal proteins 
that directly interact with microtubules in the sperm axoneme 48 
and a total of five tektins have been identified in the DmSP. 
Three tektins {CG17450, CG32820 and CG32819) reside in an 
X-linked cluster which originated very recently on the D. mela- 
nogaster lineage through sequential tandem duplication events. A 
recent analysis of the relationship of X-linked genetic variation 
with sperm competitive ability identified three distinct muta- 
tions in CG17450 which influence reproductive phenotypes in 



competitive mating experiments. 49 This finding provides strong 
support for the functional importance of the tektin gene clus- 
ter in sperm function and highlights the importance of the 
X-chromosome variation in male fertility. 

The characterization of two recently expanded structural 
sperm gene clusters on the X chromosome suggests that these 
findings may apply to other X-linked genes. For example, a recent 
study demonstrated that a testis-specific, tandemly duplicated 
pair of X-linked polyubiquitin genes also influence male fecun- 
dity. 50 As stated previously, these observations do not necessarily 
conform to current models regarding the genomic distribution 
of male-biased genes. Furthermore, they may also suggest that 
MSCI, if it exists in Drosophila, may have regionally distinct 
properties across the X chromosome that favor the creation of 
co-localized copies of genes in certain locations whose expression 
is required during later stages of spermatogenesis. Lastly, this pat- 
tern of spatial clustering is reminiscent of ampliconic regions on 
the mouse X chromosome that harbor postmeiotically expressed 
testis genes. 51 Understanding the manner in which the X chromo- 
some harbors genetic variation associated with male fertility may 
depend on understanding the selective forces responsible for the 
heterogeneous physical distribution of spermatogenesis genes on 
the X chromosome. 

Novel sperm retrogenes. Analysis of the DmSP revealed that 
a number of retrogenes encode integral sperm components 16 and 
the importance of retrotransposition, as a generator of func- 
tional genetic diversity, has been confirmed in a recent survey 
of retrogenes in the D. melanogaster genome (Dorus and Rettie, 
unpublished data). This analysis revealed that 19.8% (20 out 
of 101) encode integral sperm components (Table 1) and that 
75% of these (15 out of 20) function in specific mitochondrial 
metabolic pathways, including the tricarboxlic acid cycle, elec- 
tron transport and oxidative phosphorylation. An abundance 
of mitochondrial gene duplicates encoding sperm proteins has 
previously been attributed to selection associated with motility 
based energetic requirements. 44 However, several lines of evi- 
dence suggest that insect sperm motility is largely dependent on 
glycolysis generated ATP. The role of insect sperm mitochondrial 
derivatives as energy-producing organelles in mature sperm has 
thus been called into question (reviewed in ref. 52). Although it 
is possible that motility is enhanced by residual ATP generated 
by the nebenkern (the fused giant mitochondria that restructure 
in parallel with the developing axoneme), recent work demon- 
strates that mitochondrial processes in Drosophila sperm are 
closely linked with flagellum development. Importantly, the 
association between microtubule dynamics and nebenkern elon- 
gation is structurally essential for uniaxial spermatid tail devel- 
opment and the disruption of mitochondria-microtubule linkage 
results in developmental defects of the flagellum. 53 Based on the 
evidence that mitochondrial derived ATP is unlikely to be the 
predominant energetic source for motility, we propose that the 
selective retention of metabolic sperm retrogenes in D. melano- 
gaster may be associated with their role in the nebenkern as a 
specialized, energy producing organizing center for microtubules 
during sperm flagellum elongation. The enhancement of these 
mitochondrial pathways required for flagellum development may 
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Table 1.0. melanogaster sperm retrogenes 
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thus be associated with the remarkable diversification of flagel- 
lum length in these taxa. 4 This hypothesis is also consistent with 
the observation that numerous cytoskeletal flagellum compo- 
nents have also originated recently during D. melanogaster sperm 
evolution, including (1) retrogene Cdlc2, whose parent, Ctp (also 
known as Ddlcl) is an intermediate dynein binding protein 
required for actin filament assembly in the developing flagellum 54 
and (2) the previously discussed X-linked Sdic (sperm dynein) 
and tektin gene clusters. Although further investigation will be 
required to establish the specific functions of these novel sperm 
genes, the evolutionary timing of their origination is intriguing 
in light of the dramatic sperm flagellum length diversification 
during Drosophilidae evolution. 

Expansion of the sperm leucyl-aminopeptidase gene fam- 
ily (S-LAPs). Comparative genomic analysis in Drosophila has 
revealed that gene families with functions in reproduction evolve 
rapidly through the creation and attrition of family members. 55 
An excellent example of this is the S-LAP gene family that has 
experienced a dramatic expansion during Drosophila evolution. 
This gene family is of great functional interest because it is tes- 
tis-specific and encodes the most abundant proteins, by mass, 
in Drosophila sperm. 56 Due to the age of the S-LAPs the exact 
mechanisms and sequence of duplication events resulting in the 
family are unclear. However, the intron/exon pattern and the 
occurrence of genes in tandem suggest that there have been two 
cases of tandem duplications (S-LAP1 and S-LAP2; S-LAP3 and 
S-LAP4) and S-LAP7, that has been characterized as a retrogene. 



Perhaps the most unusual aspect of S-LAP evolution is that the 
most dramatic period of gene family expansion occurred after the 
ancestral gene underwent a series of amino acid substitutions that 
radically altered critical residues within the catalytic site, most 
likely abolishing enzymatic activity. Therefore, it appears that the 
S-LAP gene family has evolved a novel, but yet to be determined, 
sperm specific function and that this neofunctionalization may 
have been selectively involved in the retention of newly created 
S-LAP gene copies during early Drosophila evolution. The loss 
of catalytic activity among the S-LAP family is reminiscent of 
the loss of enzymatic activity and acquisition of structural roles 
in other ubiquitously expressed metabolic enzymes, such as the 
crystallin family of lens proteins. We have therefore proposed 
that the selective retention of enzymatically inactive S-LAPs may 
be related to the establishment of structural functionality, which 
may in turn be associated with sperm morphological diversifica- 
tion in these taxa. 56 

Therefore, novel gene creation has thus been an influential 
force in the evolution of the sperm proteome and recent analyses 
have begun to identify links between these loci and sperm func- 
tion. A precise understanding of the functional ramifications of 
this process still requires additional, targeted molecular studies. 
Nonetheless, patterns of novel gene creation in Drosophila sug- 
gest that the enhancement and expansion of both cytoskeletal 
and mitochondrial energetic systems may be critical to the mor- 
phological diversification of Drosophila sperm, as well as sperm 
competitive ability. In addition, the physical distribution and 
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Figure 2. Functional comparison of the Drosophila and mouse sperm proteomes. Gene ontology information was obtained from the PANTHER 
database (http://www.pantherdb.org/) and genes without annotated functions are labeled as uncharacterized. Proportions are based upon the total 
number of annotated functions, permitting genes to be present in multiple functional categories. Significant differences in the number of genes in 
Drosophila melanogaster and Mus musculus sperm proteomes for each functional category was determined using a two-tailed Fisher's Exact Test with 
Bonferroni correction for multiple testing (indicates overrepresentation in the mouse sperm proteome; t indicates overrepresentation in Drosophila 
sperm proteome). 



localized expansion of X-linked sperm gene families may provide 
useful insights into how selection has shaped genetic variation on 
the X chromosome associated with male fertility. 

Comparative Sperm Proteomics and Genomics 

Whole-cell comparative sperm proteomic analyses are now pos- 
sible with the application of MS based proteomics to sperm from a 
range of species, including Drosophila and an increasing number 
of mammalian taxa (reviewed by ref. 11). This approach, in con- 
junction with comparative genomics, has already provided novel 
insights into the conserved, as well as highly diverged, molecu- 
lar aspects of sperm function. For example, the comparison of 
D. melanogaster and the mouse sperm proteomes has revealed that 
greater than 40% of mouse axoneme proteins have homology to 
the DmSP, 13 and that greater than 65% of DmSP genes possess a 
mouse homolog, of which 20% have also been identified within 
the mouse sperm proteome. 12 In this section, we reanalyze conser- 
vation between the Drosophila and mouse sperm proteome and 
discuss functional parallels between the two, including the abun- 
dance of immunity and proteolytic proteins within the DmSP. 

Conservation of sperm proteome functional composition. 
Next to D. melanogaster, the mouse has the most comprehen- 
sively characterized sperm proteome as it has been interrogated 
using a variety of approaches. These include (1) whole-cell shot- 
gun proteomics, 57,58 (2) proteomic analysis of membrane enriched 



samples and cell surface proteins 55 and (3) 2D-gel analysis of 
biochemically purified accessory structures of the axoneme 60 and 
sperm membrane lipid rafts. 61 To assess conservation of the func- 
tional composition of sperm, we have compared the combined 
Mus musculus sperm proteome (MmSP) from these studies with 
the DmSP utilizing PANTHER functional categories (Fig. 2). 
Statistical analysis of the functional composition of the pro- 
teomes reveals substantial similarities in the functional composi- 
tion of insect and mammalian sperm proteomes with only 3 out 
of 18 functional categories identified as being statistically distin- 
guishable in their abundance between the two. These include a 
significantly higher proportion of uncharacterized genes in the 
DmSP, consistent with the higher proportion of uncharacterized 
genes in the Drosophila genome as a whole, and a significantly 
higher proportion of immunity and response to stimuli anno- 
tated proteins observed within the MmSP. The relative underrep- 
resentation of immunity proteins in the DmSP is attributable in 
part to the absence of adaptive immunity in insects, whereas pro- 
teins functioning in adaptive immunity contribute substantially 
to the total repertoire of immunity pathways present in mamma- 
lian sperm (reviewed by ref. 62). As such, with the exception of 
immunity and stimulus response processes, the sperm proteome 
of insects and mammals show a high level of evolutionary conser- 
vation at the level of their functional composition. 

Immunity proteins in the DmSP. The evolutionary trade- 
offs between reproduction and immunity have been investigated 
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Table 2. DmSP genes with functions in the immunity pathways 

Gene symbol 



GO accession 



GO term 



Innate immune response and Toll pathway 



Toll-4,' Tollo 1 
Galpha49B,' norpA' 
psidin 2 
Nos' 

Dscam,' Dscam2, Dscam3, Hel89B," Spz3 



45087, 45089 

02385 
02253, 06959 

06952 

08063 



Innate immune response 
Mucosal immune response 
Humoral Immune response 
Defense response 
Toll signaling pathway 



Antimicrobial functions 



Ms, 1 Ma,' mask,' Hel89B," Dscam; Pvr,' par-t; Tepli; TepIV, alphaTub84B'- : 
Dscam,' Tepll' 
Dscam' 
He\89B™ Dscam' 
Vps4' 



19730, 19731,08348 
42742, 50829, 50830 
61057,42834, 16019 
02805, 06963 
44130 



Antimicrobial humoral response 
Defense response to bacterium 
Peptidoglycan recognition/binding 
Regulation of AMPs 
Regulation of symbiont growth 



Antiviral functions 



Dscam,' AtgT 
form3' 

'Testis expression based on transcript presence in 4 out of 4 microarray experiments.' 
2 Upregulation of expression in the testis relative to whole fly. 68 
3 Male sterile alleles have been characterized for these genes. 



50688 Defense response to virus 

19064 Viral envelope fusion with host membrane 



extensively and female immunological responses to mating, 
based on changes in the expression of female immunity genes, 
have been well characterized in Drosphila. 63 " 67 Despite variation 
in experimental approaches, antimicrobial genes appear to be 
consistently upregulated, while innate immunity genes within 
the Toll and IMD pathways are more variable in their response. 67 
Female survival against infection also decreases in the first day 
after mating, although it is not known how regulatory responses 
in immunity pathways may ultimately be associated with this 
phenomenon. 65 Interestingly, experimental approaches designed 
to disentangle the relative regulatory effects of sperm and acces- 
sory gland proteins (ACPs) determined that ACPs exclusively 
upregulate antimicrobial peptides (AMPs) (including members 
of the Attacin and Cecropin gene families), while sperm impact 
the expression of a more widespread set of genes across a range 
of functional classes, including the downregulation of certain 
key immune response genes. 64 Therefore sperm, in the absence of 
ACPs, have the capacity to mediate female immunity responses, 
although little attention has been focused specifically on how 
components of the Drosophila sperm proteome may function in 
this manner. 

A great deal more is known about the function of immunity 
proteins in mammalian sperm and proteomic studies have been 
useful in revealing the full diversity of such pathways, which 
include a diverse repertoire of complement, adaptive immunity 
and antimicrobial proteins (reviewed in ref. 62). Functional 
characterization of immunity proteins in mammalian sperm also 
indicates that some have acquired sperm-specific functions dur- 
ing their evolution. For example, acrosome associated comple- 
ment regulator CD46 influences acrosome reaction dynamics 69 
and its presence in sperm is believed to be related to variation 
in the intensity of sperm competition across species. 70,71 Another 
notable example is DEFENSIN-|3 126 (DEFB126), an AMP 



which modulates sperm penetration of the cervical mucus in 
the female reproductive tract. 72 Common mutations in human 
DEFB126 have been shown to result in subfertility. 73 Sperm are 
coated with DEFB126 in the epididymis, and immunological 
studies have revealed that DEFB126 inhibits female immune rec- 
ognition and thus acts as a cloaking device while sperm traverse 
the female reproductive tract. 74 It is noteworthy that this process 
is quite reminiscent of the coating of Drosophila sperm by acces- 
sory gland protein Sex-peptide (SP), a protein which also influ- 
ences female immune response. 75 The widespread involvement 
of immunity pathways in mammalian sperm function raises the 
question of whether parallel functional systems may be present 
in Drosophila. 

To explore this possibility, we have surveyed the DmSP for all 
immunity-related proteins present in mature sperm. This analysis 
identified a total of 20 proteins functioning across three general 
categories: innate immunity response, antimicrobial function 
and antiviral function (Table 2). An additional 26 genes have 
annotated functions in phagocytosis and three are involved in 
hemocyte development or function. Of interest is the identifica- 
tion of Dscam 1, 2 and 3 in the DmSP, which are members of 
the Immunoglobulin-superfamily. Dscam genes have an unusual 
genomic structure containing thousands of exons that can be 
variably spliced, resulting in the potential translation of more 
than 18,000 protein isoforms. 76 This phenomenon, although 
mechanistically distinct, has parallels to somatic diversification 
of immune receptors in the adaptive immune systems of other 
taxa. Dscam 1 has been studied intensively in relation to its role 
in immunity 77 and neurogenesis, 76 but a function in reproduction 
has yet to be characterized. Consistent with the composition of 
mammalian sperm, the DmSP contains Tepll and TepIV, which 
are opsonin thioester-containing proteins homologous to mam- 
malian complement proteins. Although neither have identified 
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Table 3. DmSP genes with peptidase or proteinase inhibitor domains 



Peptidase domain 


Gene symbols 


Metallopeptidases 


Peptidase Ml 


L\jdo49, rsa, lOTZ 


Peptidase M12B 


Ppn, stl,' Tace 2 


Peptidase M13 


CG15485, CG31918, CG5527, Nep2,' Nep3, Nep5 2 


Peptidase M14 


CG17633, NnaD,' east' 


Peptidase M16 


CG10588, 2 CG373V CG4169, CG8728 


Peptidase M17 


Dip-B,' S-Lap1-5, 2 S-Lap6 {loopin-1)? S-Lap7-8 2 


Peptidase M24 or M24A 


dre4, und 2 


Peptidase M41 


CG2658, 2 CG6512 2 


Serine peptidases 


Chemotrypsin (S1A or S1/S6) 


CG32269, CG34350, CG4927, CG5390,' Conn, CG1632' 


Non-chemotrypsin (S8, S9, S10, S14,S16, S26A, S28) 


Furl,' CG17684 2 CG9059, CG4572,' CG5045, 2 CG8798, CGI 3991, 2 CG9240, 2 CG3734 2 


Other peptidases 


Peptidase A2A 


rngo 2 


Peptidse C19 


CG12082,' CG30421,' CG5794,' CG8830 


Peptidase C48 


CG10107 2 


Proteinase inhibitor 


Proteinase inhibitor 12 (Kunitz) 


axo, Ppn 


Proteinase Inhibitor 11 (Kazal) 


CG13830,CG7173,magu' 


Proteinase inhibitor 17 (squash) 


dp 


Serpin domain 


Spn43Ab, Spn47C 2 



'Testis expression based on transcript presence in 4 out of 4 microarray experiments.' 
2 Upregulation of expression in the testis relative to whole fly. 68 



functions in reproduction, they were also recently found to be 
upregulated in the female following mating. 67 DmSP constitu- 
ents were also identified across a diverse range of antimicrobial 
functions, including peptidoglycan recognition and binding, 
regulation of AMPs, regulation of symbiont growth and antimi- 
crobial humoral response. Notable among these are Hel89B, an 
ATP dependent DNA helicase, and alphaTub84B, a cytoskeletal 
structural protein, which are implicated in sperm function based 
on the identification of male sterile alleles. 

Innate immunity is dependent on the Toll and IMD path- 
ways and our survey reveals that proteins in the Toll pathway, 
but not IMD, are present in D. melanogaster sperm, including 
Toll-4, Tollo (Toll-8), spz3 and Hel89B. Activation of the Toll 
pathway occurs via the cleavage of spdtzle, which binds to the 
Toll receptor, by a serine protease cascade resulting in the activa- 
tion of the Toll receptor; this in turn leads to a cascade of protein 
interactions, culminating in the transcription of antimicrobial 
peptides. It is therefore plausible that Toll-4, Tollo and spz3 may 
be interacting proteins in sperm, if the Toll pathway in sperm 
operates in a manner consistent with its operation in immunity. 
In this context, it is interesting to note that both Toll-4 and spz3 
are recently evolved genes and may therefore have male-specific 
functions which distinguish them from stereotypical members 
of the Toll pathway. Interestingly, the elicitation of AMP genes 
by the seminal peptide, SP, is also dependent on the Toll path- 
way. 75 Although the Toll-pathway is dependent upon serine pro- 
tease activity, the functional significance of these proteins (and 



immunity proteins more generally) in the DmSP remains unclear. 
Given the abundance of serine protease pathways in both mam- 
malian and Drosophila reproductive systems it is plausible that 
their presence may be related to regulation of the Toll pathway. 

Enrichment of protease and protease inhibitors in the 
sperm proteome. Regulators of proteolysis are functionally rel- 
evant to sperm biology because of their ubiquitous abundance 
in both the male and female reproductive tract 14,78,79 and dem- 
onstrated ability to regulate the activity of other reproductive 
proteins, such as SP. 75,80 Proteases and peptidases have also been 
found to be significantly enriched within the DmSP, 12,13 which 
includes the highly abundant proteins encoded by the recently 
expanded S-LAP gene family. 57 A survey of the DmSP identifies 
a total of 60 proteins with peptidase domains (primarily metal- 
lopeptidases and serine peptidases), including 8 proteins with 
proteinase inhibitory domains (Table 3). Despite the fact that a 
number of these are over- or specifically expressed in the testis, 
none have documented functions in sperm function or fertility. 
Among the metalloproteases, stl and Tace possess the Peptidase 
M12B and ADAM/reprolysin domains found in the mammalian 
A-disintegrin and metalloproteinase {Adam) gene family. This is 
of interest as Adam proteins are abundant in the mouse sperm 
acrosome (8 are present in the MmSP) and function in sperm- 
egg recognition and fusion (reviewed by ref. 81). These may be 
excellent candidates for further functional characterization as 
relatively little is known about the molecular basis of Drosophila 
sperm-egg recognition. 
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Sexually antagonistic coevolution has been invoked to explain 
the rapid evolution of male and female proteins associated with 
sperm competition, storage and utilization. 79 One possible hall- 
mark of such a coevolving system is the involvement of unique 
male and female genes that can respond efficiently to male- and 
female-specific selective forces. If this is the case, one might pre- 
dict that male and female contributed proteolytic proteins might 
be generally distinct despite the requirement that they function 
in closely related pathways. Consistent with this prediction is 
the observation that the repertoire of serine proteases present 
within the DmSP (Table 3) is encoded by a completely distinct 
set of genes than the serine proteases highly expressed in the 
two primary sperm storage organs in Drosophila, the sperma- 
theca and seminal receptacles. 75,80 Furthermore, the presence of 
serine protease inhibitors (Serpins) within the DmSP could also 
be indicative of coevolved male-female interactions and it would 
be informative to determine if these Serpins do in fact regulate 
serine proteases in the female reproductive tract. To this end, it 
would be useful to determine if Serpins are localized to the sperm 
cell membrane where they would be accessible to direct molecu- 
lar interactions. Although little is known about their reproduc- 
tive functions, targeted proteomic analysis of the mouse sperm 
cell membrane and acrosome have identified a remarkable diverse 
family of Serpins, 59 of which several show evidence of rapid evo- 
lution or positive selection. 58 The shared presence of Serpins in 
Drosophila and mouse sperm not only supports their functional 
relevance to sperm function but also suggests that similar path- 
ways (e.g., serine proteases and their inhibitors) may be gener- 
ally involved in coevolved male-female molecular interactions 
relevant to sperm competition and fertilization. 

Conclusions and Future Perspectives 

Sexual selection, as a primary selective force in evolution, was 
first discussed by Charles Darwin 82 and has been a major focus 
of both theoretical and empirical investigations by evolutionary 
biologists ever since. The amount of phenotypic data relevant to 
the evolution of reproductive traits is tremendous, yet despite the 
advancement of relevant genetic theories, a nuanced understand- 
ing of the evolutionary relationship between genotype and phe- 
notype in reproductive systems has yet to be achieved. Molecular 
evolutionary analyses of reproductive genes has revealed wide- 
spread evidence of rapid evolution, although few examples have 
been documented where the underlying selective forces are 
well understood. Sperm may ultimately be an ideal system for 
exploring how sexual selection operates on genotype-phenotype 
relationships because they are amenable to experimental manip- 
ulation and we possess well-developed theoretical predictions 
concerning the impact and outcomes of sperm competition. In 
this review, we have thus set out to describe our current under- 
standing of the D. melanogaster sperm proteome and how it has 



informed our understanding of sexual selection and the evolu- 
tionary genomics of sperm. 

Characterization of the D. melanogaster sperm proteome has 
provided the necessary knowledge base for investigating the 
genetic basis of sperm evolution. These analyses identified an 
unexpected level of functional constraint upon the proteome as a 
whole and also revealed significant evolutionary rate heterogene- 
ity within male reproductive systems. Ultimately, a more detailed 
understanding of compartmentalized adaptation in response to 
sexual selection has been achieved through additional studies in 
both mammals and insects. 58,83 Analysis of the sperm proteome 
has also revealed that gene creation is a prominent mechanism 
in the evolutionary diversification of the sperm proteome. More 
significant perhaps is the observation that the selective reten- 
tion of novel sperm genes appears to be non-random, as novel 
Drosophila genes are enriched within mitochondrial energetic 
pathways and proteins functioning in the development and struc- 
ture of the sperm flagellum. Although the ultimate ramifications 
of this trend have yet to be determined, recent progress has been 
made in associating sperm competitive ability with genetic varia- 
tion in some of these novel sperm components. We anticipate 
that the sperm proteome will be a valuable asset as more studies 
attempt to elucidate the association between intra- and interspe- 
cific genetic variation and a range of sperm phenotypes, includ- 
ing the independent origination of sperm gigantism in multiple 
Drosophilidae species. 84 

Previous molecular analyses of sexual antagonistic coevolu- 
tion have naturally focused on the unique complement of secreted 
proteins of the male accessory gland and, more recently, genes 
upregulated within the female reproductive tract. Prominent 
among these are a diverse set of proteolytic and immunity genes, 
many of which have been demonstrated to evolve rapidly. 14,78,79,85 
Analysis of both the Drosophila and mouse sperm proteomes 
reveal that these functional classes are abundant in sperm and, in 
the case of the mouse, that many of these proteins are localized 
to the cell membrane or acrosome and evolve rapidly. 58 Although 
the true abundance of immunity protein in sperm has only 
recently been revealed through proteomic analysis, it has long 
been recognized that male immunity proteins may be influential 
in mediating female immune responses post-mating. 86 Despite 
the observation that sperm induce a range of gene expression 
changes in the female reproductive tract, including some immu- 
nity genes, almost nothing is known about the function of the 
innate immunity or complement system in Drosophila sperm. 
Given the central role of serine proteases in the Toll pathway and 
their presence within both male and female reproductive sys- 
tems, the sperm proteome provides a foundation for exploring the 
functional coevolution of immunity and proteolytic pathways in 
reproduction, particularly those related to male-female interac- 
tions influencing sperm utilization and fertilization. 
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